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ON  THK  CANDIDATE  PROBLEM  WITH  A  RANDOM  NUMBER  OF  CANDIDATES" 


— 


by 

C.  Herman,  Columbia  University 
C.J.  Liebrrman,  Stanford  University 
S.M.  Ross,  University  of  California,  Berkeley 


1.  Introduction 

In  the  problem  under  cons lderat Ion  a  decision  maker  has  a 
total  of  M  candidates  to  Interview  sequentially.  The  decision 
maker  must  either  accept  or  reject  the  candidate  being  Interviewed 
after  he  has  been  ranked  with  respect  to  his  predecessors.  Once 
rejected  a  candidate  cannot  be  reconsidered;  once  a  candidate  Is 
accepted  no  further  Interviews  are  carried  out.  The  objective  Is 
to  sele  .  t  the  candidate  In  such  a  way  as  to  maximize  the  probability 
of  choosing  the  beat  of  all  M  candidates  (assuming  every  ordering 
of  Interviews  is  equally  likely). 

In  the  classical  version,  M  •  a  is  fixed  and  the  optimal 
•»ele.  t  1  on  policy  is  to  Interview  r  -  1  candidates  without  choosing 
anv.  then  select  the  first  leading  candidate  interviewed  thereafter. 
Here 


r  -  r  (m) 


min  1 r  » 


H  l 


k-r 


1/k  <  1 
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(See  Chow,  et.al.  (1)  p.  51  or  Derman  [3]  p.  118.) 

Rasmussen  |5]  end  Rasmussen  end  Robbins  (6)  generalize  the 

problem  to  the  case  where  M  is  random  with  known  probability  (unction 

PtM  •  ml  -  p  ,  a  -  1 . N,  N  <  ».  The  approach  taken  in  |6)  rests 

ns 

on  the  presumption  (asserted  In  { 5 J )  that  the  optimal  solution  policy 
possesses  the  above  slaple  fora  and  that  one  need  only  find  the  value 
of  r  that  maximizes 

I  P „/■.  if  r  -  1 

<*r>  ‘  }  N  a- 1 

( r— 1 >  [  pB/a  l  1/k,  If  r  >  2  , 

ar*r  k“r- 1 

the  a  priori  probability  of  selecting  the  best  candidate  under  the 
rule  which  lets  the  first  r  -  1  go  by  and  then  selects  the  first 
leader  that  occurs  thereafter. 

The  assertion  of  (5),  however,  is  not  true  for  every  probability 
function  (p  ).  In  Section  2  a  counter-example  shows  this.  In  Section 
3  a  sufficient  condition  Is  given  for  the  optimal  policy  to  have  the 
slaple  fora.  The  device  for  generating  the  condition  is  the  "one-step 
look-ahead"  criterion  given  by  Derman  and  Sacks  [4J  (also  by  Chow  and 
Robbins  (2)).  It  turns  out  that  this  criterion  applies  if  and  only  If 
«.'(r)  is  unlaodal.  The  condition  given  here  for  the  criterion 
to  apply  iaproves  on  the  one  given  In  (6]  for  uKr)  to  be  unlmodal. 
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2  •  Counter-ex— pie 


If  (pB)  were  e  two-point  distribution  with  large  probability 
Mas  at  H  -  2  (aay)  and  small,  but.  positive,  probability  mass  at 

M  -  N(N  large),  a  simple  selection  policy  would  have  r  •  1  or  2. 
Intuitively,  however,  It  would  seem  that  a  better  policy  would  be  to 

select  the  second  candidate  If  he  was  better  than  the  first  and  If 
not  to  let  a  number  of  candidates  go  by  before  deciding  to  choose 
the  next  leading  candidate.  Formally,  suppose 


P2  -  1  -  t,  pN  -  t  . 


Then 


<P(1)  -  ♦  c/N 


0(2) 


1-t. 

2 


N-l 

l 

k“  1 


1/k 


0(r)  <  t,  3  <  r  <  li  . 


So  for  N  >  2  and  C  small 


0(2)  -  max  0(x) 
r 
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However,  consider  the  policy  that  stops  at  r  “  2,  If  possible, 
or.  If  M  •  N,  at  the  first  leading  candidate  after  1-1  candidates 


have  been  Interviewed,  where  1  shall  be  appropriately  determined. 

«.  denote  the  probabllltv  of  selecting  the  best  candidate  under  this 
policy.  Then, 


v’ -  P;M  -  2)  PlSelectlng  Best  Candidate  M  -  2) 

♦  P1.  M  -  S  P'  Selecting  Sent  Candidate  M  •  N 


•  1/2 


2/S  ♦  1/2 


1-t 


♦  t/S 


1 


♦ 


1-  ) 

2 


N-3 

l 


k-l-J 


1/k 


Thus , 


If 


C’  >  c  (2) 


1 


♦ 


1-3 

2 


N-  3 

l 


k-i- 


1 


N-l 

1/k  1  l  1/k  . 

k-1 


let 


If  1  Is  anv  integer  greater  than  5  and  N  is  sufficiently  large 
the  latter  Inequality  will  hold. 


1.  *>ne-step  Look  Ahead  Policies 

Consistent  with  our  use  of  the  expression  in  the  foregoing,  we 
sav  a  leading  candidate  is  being  interviewed  if  he  ranks  above  all 
previously  interviewed  candidates.  A  one-slej>  look  ahead  policy  is  one 
that  accepts  a  leading  candidate  if  and  only  if  the  probability  that  he 
is  the  best  exceeds  the  probability  that  one  more  leader  will  be  inter¬ 
viewed  and  that  he  will  be  the  best.  Applying  a  theorem  by  Derman  and 
sacks  [ 4 1  (also  Chi*w  and  Robbins  [2]),  the  one-step  look  ahead  criterion 
yields  an  optimal  selection  policy  if  there  is  «•>  r  -  1  such  that 
one-step  look  ahead  accepts  (rejects)  a  leader  among  the  first  J 
interviewed  if  j  *  r  -  1(J  <  r  -  1). 

Let 

P. 

p(m  1)  -  — - ,  «  >  i  , 

'  n 


denote  the  conditional  probability  that  M  •  m  given  M  »  i.  Let 


denote  the  event  that  the 
denote  the  event  that  the 


candidate  interviewed  is  a  leader, 
candidate  is  the  best  .  Then 
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(say)  . 


N 

)  -  j  l  £J*iU  R(j) 
n»-  j  m 

Let  f»  denote  the  event  that  the  J1*'  candidate  observed  is  a  leader, 
one  mure  leader  will  be  observed  and  that  he  will  be  the  best.  Then 


N  ™ 

JM*.  M  ’  J.  fi  )  •  l  p(m  J)  I  X.  k 
m-)  +  l  k-J  +  1  k(k-l)  m 

•  )  I  ’j'  Uk 

to-J+1  k-J 


S(J)  (ssy) 


Thus,  the  one-step  look  policy  Is  optimal  If  for  some  r  -  1 

(1)  R(J)  -  S(J)  -  0,  }  1  r  ■  1 

.*  0,  J  >  r 


J-l 

Sow,  Interpret  In#  [  1/k  -  0, 

k-J 


R(J)  -  S  ( J  ) 


S  .  s-1 

1  I  i/i 

»-)+l  k-J 


ft 


where  *(J)  la  the  expression  within  the  last  brackets.  The  sign  of 
R(j)  "  S(J)  ia  the  same  as  that  of  *(J).  Thua,  from  (1)  followe 


Proposition  1;  A  one-atep  look  ahead  policy  la  optimal  If  for 
some  r  -  1 


*(j)  <  0,  J  *  1 . r  -  1 


p(j)  >  0.  J  -  r . N  . 


The  hypothesis  of  proposition  1  Is  equivalent  to  <£(r)  beinR 
unlmod.il  since,  as  given  in  (6), 


«^<J)  -  *  «-(}).  J  "  1 . »  -  1 


Now  let 


H.  -  P.  '  l  P„/(»*  J  *  1 . N  *  1 

3  3  m-J>l 


In  tenas  of  N  we  state 


Condition  ( 


*):  >  0  implies  Hj  _>  0  for  1  >  J 


7 


W*  now  have 

Theorem  1:  If  condition  (*)  holds,  then  there  Is  an  r  such  that 
(2)  holds  ( 1 . e . ,  a  one-stop  look  ahead  policy  Is  optimal). 

Proof:  In  Rasmussen  and  Robbins  (b)  It  Is  Riven  that 

v(J)  -  v(j  +  l)  -  »yj.  J  -  1 . N  -  1  . 

Let  l  be  any  Integer  such  that 

*<i)  1  0  . 

If  H j  <  0,  then 

v(H-l)  I  v(i) 

>  0  . 


If  H  -0.  then  by  condition  (*) 


•  1  *  1 . “  -  '• 

and  hence.  *(J)  Is  non- Increas i ng  In  )  for  all  J.  1  «  j  f,  N.  However 


*(N)  >  0; 


therefore,  *(j)  >  0  for  all  J,  1  <  J  <  N.  Thus,  in  both  cases 
*(1*1)  .>  0,  proving  the  theorem. 

Remarks:  Condition  (*)  is  weaker  than  the  condition  p,  <  p„  <  •••  <  D 

Kiven  in  [6],  it  is  also  satisfied  by  the  gecsset rlc  distribution  over 
1.  ....  S,  a  distribution  for  which  «P(r)  Is  unlmodal  (as  stated  In  |6]) 
It  Is  not  necessary  to  restrict  N  to  being  finite;  under  condition  (*) 
Theorem  1  will  hold  with  N  -  »  since,  in  this  case.  Urn  v(J)  -  0  . 
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I)  «ftr**CT  •»  II  » •  «••••*?  am4  »#•**'  tty  kt  »!•«» 

A  decision  maker  has  a  total  of  M  candidates  to  Interview  sequentially. 
The  decision  maker  must  either  accept  or  reject  the  candidate  being  Interviewed 
after  he  has  been  ranked  with  respect  to  his  predecessors.  Once  rejected  a 
candidate  cannot  be  reconsidered;  once  a  candidate  is  accepted  no  further  Inter¬ 
views  are  carried  out.  The  objective  is  to  seleit  the  candidate  In  such  a  way  as 
to  maxlml*e  the  probability  of  choosing  the  best  of  all  M  candidates  (assuming 
every  ordering  of  Interviews  Is  equally  likelv).  ^ 


(Cont inued ) 


_ Unc  lassif  led _ _ 

IICwSlTV  Cl  AlliTiC  ATiON  0»  tml  n«l  (»»•• 


%  V  UMi  T  V  Cl  AIM?  |C  AYlON  OF  ThI\  PiO«  D«t*  / 


In  the  classical  version,  M  •  m  is  fixed  and  the  optimal  selection  policy 
is  to  Interview  r  -  1  candidates  without  choosing  any;  then  select  the  first 
leading  candidate  interviewed  thereafter.  Here 

*-l 

r  >  l|  [  1/k  <  1  . 

k-r 

Rasmussen  (1975)  and  Rasmussen  and  Robbins  (1975)  generalize  the  problem  to 
the  case  where  M  is  random  with  known  probability  function 

P(M  •  m)  »  p  ,  m  •  l ,  .  ...  N,  N  <  *».  The  approach  taken  by  Rasmussen  and  Robbins 

rests  on  the  presumption  (asserted  by  Rasmussen)  that  the  optimal  solution  policy 
possesses  the  above  simple  form  and  that  one  need  only  find  the  value  of  r  that 
maximizes 


a 

r  •  r  (m)  “  min 


9>(r) 


I  P»/B- 

»-l 

N 

(T-l)  [  PB/» 

nr-r 


®-l 

[  l/k. 

k-r-1 


if  r  -  1 

If  r  >  2  , 


the  a  priori  probability  of  selecting  the  best  candidate  under  the  rule  which  lets 
the  first  r  -  1  go  by  and  then  selects  the  first  leader  that  occurs  thereafter. 

Rasmussen's  result,  however,  is  not  true  for  everj  probability  function 
(p^  .  In  Section  2  a  counter-example  shows  this.  In  Section  3  a  sufficient 

condition  is  given  for  the  optimal  policy  to  have  the  simple  form.  The  device  for 
generating  the  condition  is  the  "one-step  look-ahead"  criterion  given  by  Dorman 
and  Sacks  (I960)  (also  by  Chow  and  Robbins  (1961)).  It  turns  out  that  this 
criterion  applies  if  and  only  if  <P(r)  is  unlssMlal.  The  condition  given  here 
for  the  criterion  to  apply  improves  on  the  one  given  by  Rasmussen  and  Robbins  (1975) 
for  v?(r)  to  be  unimodal. 
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